On Effectiveness of Database Accessing Methods for Subset Searching

نویسنده

  • Maciej Zakrzewicz
چکیده

The field of relational database research has developed many database accessing methods for effective data retrieval, e.g. B tree indexing, Bitmap indexing, hash-based join, sort-merge join. These methods are oriented on finding or joining single items (represented by records) that satisfy point or range conditions. However, in the area of data mining research, there is often the need to effectively retrieve the multi-item sets that contain a given multi-item subset. We will refer to this type of retrieval as Subset Search Problem. In this paper we formally define the Subset Search Problem. We analyze and experimentally verify the usefulness and effectiveness of the most common database accessing methods applied to the Subset Search Problem. The results show that Bitmap indexing gives the best improvement to the implementation of the Subset Search Problem.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Performance Evaluation of Parallel S-Trees

The S-tree is a dynamic height-balanced tree similar in structure to B + trees. S-trees store xed length bit-strings, which are called signatures. Signatures are used for indexing textbases, relational, object oriented and extensible databases as well as in data mining. In this article, methods of designing multi-disk B-trees are adapted to S-trees and new methods of parallelizing S-trees are d...

متن کامل

A Parallel Genetic Algorithm Based Method for Feature Subset Selection in Intrusion Detection Systems

Intrusion detection systems are designed to provide security in computer networks, so that if the attacker crosses other security devices, they can detect and prevent the attack process. One of the most essential challenges in designing these systems is the so called curse of dimensionality. Therefore, in order to obtain satisfactory performance in these systems we have to take advantage of app...

متن کامل

A Parallel Genetic Algorithm Based Method for Feature Subset Selection in Intrusion Detection Systems

Intrusion detection systems are designed to provide security in computer networks, so that if the attacker crosses other security devices, they can detect and prevent the attack process. One of the most essential challenges in designing these systems is the so called curse of dimensionality. Therefore, in order to obtain satisfactory performance in these systems we have to take advantage of app...

متن کامل

The Effectiveness of Preoperative Exercises on the Outcomes After Anterior Cruciate Ligament Reconstruction: A Systematic Review

Objective: Quadriceps weakness is common after Anterior Cruciate Ligament (ACL) injury and subsequent surgery. Preoperative defects affect postoperative outcomes. The purpose of this review study was to investigate whether preoperative exercises can affect the postoperative outcomes after ACL reconstruction. Methods: The searching for papers was conducted in the PubMed database among the studi...

متن کامل

Comparison of Bibliographic Databases in Retrieving Information on Telemedicine

Background & Aims: Some of the main questions which can be of importance for those researchers who intend to perform a systematic review in a field of science are: ‘What databases should I use for my review?’; ‘Do all these databases have the same value?’; and ‘Which sourcesretrieved the highest of relevant references?’. The main aim of this work was the identification of the best database for ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002